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Abstract: 

The spike (S) protein of the recently emerged human coronavirus (MERS-CoV) mediates 
infection by binding to the cellular receptor dipeptidyl peptidase 4 (DPP4). Here we mapped the 
receptor-binding domain in the S protein to a 231-amino acid fragment (residues 358-588) by 
evaluating the interaction of spike truncation variants with receptor expressing cells and soluble 
DPP4. Antibodies to this domain - much less so to the preceding N-terminal region - efficiently 


neutralize MERS-CoV infection. 


Text 
Just 10 years following the outbreak of the severe respiratory acute syndrome coronavirus 
(SARS-CoV) the world is confronted with yet another deadly human coronavirus. The virus, first 
provisionally called human coronavirus-EMC (hCoV-EMC) (1, 2) but now named MERS-CoV, 
referring to its emergence in the Middle-East and to the respiratory syndrome it causes, 
belongs to the betacoronavirus genus lineage 2c (3). As of June 7" 2013, 55 cases have been 
laboratory confirmed including 31 deaths, all from - or linked to - the Arabian Peninsula (4). Like 
with SARS-CoV, patients affected by MERS-CoV suffer from severe and often lethal lower 
respiratory tract infection. The epidemiology of MERS-CoV is still enigmatic, but the 
geographical distribution of epidemiologically unlinked individuals points to intermittent, 
zoonotic transmission from a — so far unknown - animal source, whereas a number of reported 
clusters indicate limited human-to-human spread (5). 

The main determinant of coronavirus tropism is the viral spike (S) protein as it mediates 


binding to a cell-surface receptor. The MERS-CoV S protein, a 1353 amino acid type | 
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membrane glycoprotein, assembles into trimers that constitute the spikes or peplomers on the 


surface of the enveloped coronavirus particle. The protein combines the two essential entry 


functions, namely that of host receptor binding and membrane fusion, which are attributed to 


the N-terminal (S1, residues 1-751) and C-terminal (S2, residues 752-1353) half of the S 


protein, respectively (Fig.1a). Recently we have identified dipeptidyl peptidase 4 (DPP4, also 


known as CD26), expressed in the human lung, as a functional receptor for MERS-CoV(6). 


Importantly, MERS-CoV can also use the evolutionary conserved DPP4 of other species, most 


notably that of bats(6, 7). 


Coronaviruses bind to receptors via independently folded, generally about 150-300 


residues long receptor binding domains (RBD) present in their S1 subunit, of which the location 


within S1 can vary (8-10). Thus, for the betacoronavirus mouse hepatitis virus (MHV) the 


binding to its CEACAM receptor (11) has been mapped to the N-terminal ~300 amino acids of 


the spike protein (12, 13) whereas for the SARS-CoV - of the same genus - binding to the ACE2 


receptor (14) maps to residues 323-502 of S1 (15, 16) (Fig.1a). Identification of the RBD can 


hence help the development of monoclonal antibodies and vaccines for the treatment and 


prevention of infection. The RBD is the most important target for neutralizing antibodies (12, 17, 


18) preventing virus-receptor interaction. 


We previously used the S1 domain of MERS-CoV fused to the Fc-region of human IgG to 


demonstrate the interaction of S1 with DPP4-expressing cells and with soluble, i.e. non 


membrane-anchored DPP4 (6). To identify the receptor binding domain in the MERS-CoV $1 


subunit, we generated S1-Fc protein chimera’s with truncations at the C-terminus and 


N-terminus of the S1 domain. We considered a three domain structure of the MERS-CoV $1 
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protein (residues 1-357, 358-588 and 589-747) based on the predicted location and structure of 


the RBD of two other betacoronaviruses, MHV and SARS-CoV (12, 13, 15, 16), of which the 


homologous regions for MERS-CoV S map to the residues 18-351 and 379-580, respectively 


(Fig.1b). In addition, a soluble form of human DPP4 (residues 39-766) was made, which was 


C-terminally tagged with the Fc region. These proteins were expressed in HEK-293T cells after 


transfection of the respective expression plasmids and subsequently affinity-purified from the 


cell culture supernatant using protein A sepharose beads as described(6). The Fc region of 


purified sDPP4-Fc was proteolytically removed using trypsin (data not shown). First, we 


analyzed the S1-Fc proteins and C-terminal S1 truncations thereof for their ability to interact 


with sDPP4 using a co-purification assay. sSDPP4 was efficiently co-purified by the S1-Fc 


variants encompassing residues 1-588 and 1-747 whereas the 1-357 S1-Fc variant was unable 


to bind sDPP4 (Fig.2a). We next generated an S1-Fc variant comprising residues 358-588, a 


region homologous to the ACE2 receptor binding domain in SARS-CoV $1 (Fig.2a). This S1-Fc 


truncation variant efficiently bound soluble DPP4, indicating that the DPP4 receptor binding 


domain is located within the 358-588 residues domain of the MERS-CoV spike protein. 


We subsequently tested the ability of these S1-Fc variants to bind to HEK-293T cells 


transiently expressing DPP4 by using flow cytometry. The S1-Fc variants encompassing 


residues 1-588 and 358-588 bound to DPP4-expressing HEK-293T cells with efficiencies 


comparable to the full-length S1 protein whereas no binding was observed with the 1-357 


$1-Fc variant (Fig.2b). These data show the 358-588 amino acids S1 region to be essential and 


sufficient for binding to DPP4-expressing cells, consistent with the results of the sDPP4 


interaction study. 
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To confirm the observed interactions in a more biological assay we analyzed the ability of 


the S1-Fc variants to prevent MERS-CoV infection. Thus, Huh-7 cells were preincubated with 


the different S1-Fc variants before being inoculated with the MERS-CoV. We found that the 


variants encompassing residues 1-747 and 358-588, but not the 1-357 S1-Fc variant, inhibited 


infection (Fig.2c). 


Finally polyclonal antibodies were raised in rabbits against the 1-747, 1-357 and 358-588 


$1-Fe variants (Davids Biotechnology GmbH, Germany). The sera, which displayed equal 


ELISA titers towards its antigen (1:300.000, data not shown), were tested for their ability to 


neutralize virus infectivity. Antibodies raised against the 358-588 S1-Fc variant efficiently 


neutralized virus infectivity, superior to those raised against the 1-747 and 1-357 S1-Fc variants 


(Fig.2d). This indicates that neutralizing epitopes within S1 are primarily localized to the RBD 


region. The elicited antibodies are likely to block the interaction of the spike protein with DPP4 


thereby neutralizing MERS-CoV infectivity. Of note, antibodies raised against the MERS-CoV-S 


RBD did not cross-neutralize SARS-CoV infection (data not shown). The results demonstrate the 


potential of S1 protein and of the 358-588 S1 polypeptide as subunit vaccines with a high 


biosafety profile compared to vaccines based on inactivated viruses or live-attenuated virus. 


Except for the betacoronavirus MHV, which binds to its CEACAM receptor through a 


domain in the N-terminal part of its S1 protein, the RBDs of all other coronaviruses that engage 


protein receptors and that have been mapped occur in the C-terminal portion of the S1 subunit 


(Fig.3). Examples also include the a/phacoronaviruses binding to ACE2 (hCoV-NL63) and APN 


(e.g. TGEV, hCoV-229E) (10, 19-25). In this study we have experimentally mapped the RBD of 


MERS-CoV to a 231-amino acid fragment (residues 358-588) within the spike protein. This 
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domain nicely corresponds with the S1 region recently anticipated to interact with the DPP4 


receptor on the basis of theoretical S1 structure predictions (26). The RBD in the MERS-CoV 


$1 protein localizes in the same region where the SARS-CoV S protein interacts with its ACE2 


receptor (26). The SARS-CoV RBD structure displays a 5-stranded B-sheet core structure (61-4 


and B7) maintaining the overall domain conformation, and a long extended loop containing two 


anti-parallel B-sheets (85 and 86) responsible for receptor binding(16). Intriguingly, compared 


to SARS-CoV, the RBD of MERS-CoV contains a relatively conserved core domain but a highly 


variable loop region, tentatively explaining the differential receptor usage(26). Crystallization 


and structure analysis of this MERS-CoV RBD region in complex with DPP4 will give detailed 


insight into the virus-receptor binding interface. 


Figure legends 


Figure 1. Receptor binding domains in betacoronavirus spike proteins and S1-Fc expression 


constructs. a, Schematic representation of the betacoronaviruses SARS-CoV, hCoV-EMC S$ 


and MHV (strain A59) spike (S) protein sequence (drawn to scale) aligned at the S1-S2 junction. 


The known receptor binding domain in the S1 subunit of MHV and SARS-CoV S proteins and 


their corresponding homologous regions in hCoV-EMC S as defined by ClustalW alignment are 


indicated. Positions of the transmembrane domain (yellow bar; predicted by the TMHMM server) 


and of the predicted N-glycosylation sites (WY; predicted by the NetNGlyc server, only shown for 


the hCoV-EMC S) are indicated. The border between the S1 and S2 subunits of the spike 


protein is represented by a vertical white line. b, upper panel, schematic presentation of the 


hCoV-EMC $1 subunit (residues 1-751) sequence. Cysteine positions in S1 subunit are 
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indicated by vertical white lines with corresponding amino acid positions on top. Positions of 
cysteines highly conserved among betacoronaviruses S1 proteins are in bold. Predicted 
disulfide bond connections inferred from the structure of the SARS-CoV receptor binding 
domain are presented as connecting black lines underneath. Lower panel, domains of the 


hCoV-EMC $1 subunit expressed as Fc chimera’s. 


Figure 2. The DPP4 binding domain is located within residues 358-588 of the hCoV-EMC spike 
protein and efficiently elicits neutralizing antibodies. a, S1-Fc chimeric proteins and soluble 
DPP4 (sDPP4) receptor were expressed from HEK-293T cells and purified from the culture 
supernatant. S1-Fc proteinswere mixed with sDPP4 followed by protein A sepharose affinity 
isolation, analyzed on a NoVEX® 4-12% Tris-Glycine gradient gel under non-reducing 
conditions, and stained with GelCodeBlue reagent. Position of the S1-Fc proteins - running as 
dimers under non-reducing conditions due to an Fc interchain disulphide bond - and sDPP4 as 
well as the sizes of the marker proteins are indicated. Individual proteins were loaded as 
controls. b, Binding of hCoV-EMC S1-Fc proteins to DPP4 expressing cells. 2.5x10° HEK-293T 
cells transfected with control pbCAGGS (grey shaded area) or with pCAGGS-DPP4 (black line) 
expression plasmid were incubated with 15 ug/ml of the indicated S1-Fc followed by incubation 
with DyLight488 labeled goat-anti-human IgG antibody and analysis by flow cytometry. An 
Fc-chimera containing the S1 of infectious bronchitis virus (IBV-S1-Fc) was taken along as a 
negative control. c, Inhibition of hCoV-EMC infection by S1-Fc 1-747, 1-357 and 358-588 
variants. Huh7 cells were preincubated with 40 ug/ml S1-Fc 1-747, 1-357 or 358-588 for 0.5 h 


prior to virus inoculation (1 h), all at RT. Mock incubated cells (control) and cells incubated with 
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a DPP4 polyclonal antibody (anti-DPP4) were taken along as controls. Following incubation at 


37° C for 8 hours, infected cells were detected by immunofluorescence and infection was 


quantified (relative to control). The experiment was carried out twice and the data of one a 


representative experiment are shown. Error bars indicate standard error of the mean. d, 


Neutralization of hCoV-EMC infection by rabbit antisera raised against the S1-Fc 1-747, 1-357 


and 358-588 variants. Virus (200 pfu) was premixed 1:1 with serial dilutions of sera obtained 


(open bars) or after immunization (closed bars) prior to inoculation onto VERO cells and virus 


infection was monitored by the occurrence of CPE at 72 hours post infection. Virus 


neutralization titers (VNT) were determined in quadruplicate as the highest serum dilutions that 


completely prevent CPE. The experiment was carried out twice and the data of one a 


representative experiment are shown. Error bars indicate standard error of the mean. 


Figure 3. Localization of receptor-binding domains in coronavirus spike proteins. Schematic 


presentation of the spike proteins of the a/johacoronaviruses TGEV and hCoV-NL63 and of the 


betacoronaviruses SARS-CoV, hCoV-EMC and MHV (drawn to scale), aligned at the S1-S2 


junction. Blue boxes represent the receptor-binding domains (RBD) and indicate the engaged 


receptor. The RBD of TGEV, hCoV-NL63, SARS-CoV and MHV have been confirmed by 


crystallography (12, 15, 22, 26). Grey boxes indicate the transmembrane domain. Sequence 


IDs: TGEV (ABG89335.1), hCoV-NL63 (NC_005831.2), SARS-CoV (NP_828851.1), 


hCoV-EMC (AFS88936.1), MHV (NC_001846.1). 
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